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Abstract 



Primitive Optimality Theory (OTP) (Eisner 
1997a; Albro, 1998), a computational model 



of Optimality Theory (Prince and Smolensky 



1993), employs a finite state machine to repre- 



sent the set of active candidates at each stage 
of an Optimality Theoretic derivation, as well 
as weighted finite state machines to represent 
the constraints themselves. For some purposes, 
however, it would be convenient if the set of 
candidates were limited by some set of crite- 
ria capable of being described only in a higher- 
level grammar formalism, such Context 
Free Grammar, a Context Sensitive Grammar, 
or a Multiple Context Free Grammar ( peki et 
al., 1991). Examples include reduplication and 
phrasal stress models. Here we introduce a 
mechanism for OTP-like Optimality Theory in 
which the constraints remain weighted finite 
state machines, but sets of candidates are repre- 
sented by higher-level grammars. In particular, 
we use multiple context-free grammars to model 



reduplication in the manner of Correspondence 



Theory ( McCarthy and Prince, 1995 ), and de- 
velop an extended version of the Earley Algo- 
rithm ( Ear ley, 1970| ) to apply the constraints to 
a reduplicating candidate set. 

1 Introduction 

The goals of this paper are as follows: 

• To show how finite-state models of Opti- 
mality Theoretic phonology (such as OTP) 
can be extended to deal with non-finite 
state phenomena (such as reduplication) in 
a principled way. 

• To provide an OTP treatment of redupli- 
cation using the standard Correspondence 
Theory account. 



• To extend the Earley chart parsing algo- 
rithm to multiple context free grammars 
(MCFGs). 

The basic idea of this approach is to begin 
with a non-finite-state description of the space 
of acceptable candidates {e.g., candidates with 
some sort of reduplication inherent in them, or 
candidates which are the outputs of a syntac- 
tic grammar), and to repeatedly intersect the 
high-level grammar representing those candi- 
dates with finite state machines representing 
constraints. The intersection operation is one 
of weighted intersection (where only the set of 
lowest-weighted candidates survive) in order to 
model Optimality Theory, and will make use of 
a modified version of the Earley parsing algo- 
rithm. 

There are at least two alternative approaches 
to that which we will propose here: to aban- 
don finite state models altogether and move to 



uniformly higher- level approaches {e.g., Tesar 
(1996)), or to modify finite state models mini- 



mally to allow for (perhaps limited) reduplica- 
tion {e.g., IWalther (2000| )). The first of these 
alternative approaches deals with context free 
grammars alone, so it would not be able to 
model reduplicative effects. Besides this, it 
seems preferable to stick with finite-state ap- 
proaches as far as possible, because phonolog- 
ical effects beyond the finite state seem quite 
rare. The second of these approaches seems rea- 
sonable in itself, but it is not suited for the 
type of analyses for which the approach laid 
out here is designed. In particular, Walther's 
approach is tied to One-Level Phonology, a the- 
ory which limits itself to surface-true generaliza- 
tions, whereas the approach here is designed to 
model Optimality Theory — a system with vio- 
lable constraints — and in particular Correspon- 
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dence Theory. Tesar's approach as well, while 
it is a model of Optimality Theory, does not 
seem suited to Correspondence Theory. A final 
argument for using this approach, in preference 
to one similar to Walther's approach, is that it 
can be extended to cover other non-finite-state 
areas of phonology, such as phrasal stress pat- 
terns, with no modification to the basic model. 

2 Quick Overview of OTP 

2.1 Optimality Theory 

Optimality Theory (OT), of which OTP is a 
formalized computational model, is structured 
as follows, with three components: 

1. Gen: a procedure that produces infinite 
surface candidates from an underlying rep- 
resentation (UR) 

2. Con: a set of constraints, defined as func- 
tions from representations to integers 

3. Eval: an evaluation procedure that, in suc- 
cession, winnows out the candidates pro- 
duced by Gen. 

So OT is a theory that deals with potentially in- 
finite sets of phonological representations. The 
OT framework does not by itself specify the 
character of these representations, however. 

2.2 Primitive Optimality Theory 
(OTP) 

The components of OT, as modeled by OTP 
(see [ Eisner (1997a| ), [Eisner (1997b|) , |Albro| 
p9980 )^ 



1. Gen: a procedure that produces from an 
underlying representation a finite state ma- 
chine that represents all possible surface 
candidates that contain that UR (always 
an infinite set) 

2. Con: a set of constraints definable in 
a restricted formalism — internally repre- 
sented as Weighted Deterministic Finite 
Automata (WDFAs) which accept any 
string in the representational alphabet. 
The weights correspond to constraint vio- 
lations. The weights passed through when 
accepting a string are the violations in- 
curred by that string. 
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Figure 1: OTP Representation 

3. Eval: the following procedure, where 
/ represents the input FSM pro- 
duced by Gen, and M is a machine 
representing the output set of candi- 
dates: 
M ^ I 

for all Ci € Con, taken in rank order 
do 

M <— intersection of M with Cj 
Remove non-optimal paths from M 
Zero out weights in M 
end for 

Representations in OTP are gestural scores us- 
ing symbols from the set {—,+,[,],!}• See 
Figure Q for an example. This figure shows 
a CVCC syllable in a conventional notation, 
and also in OTP notation. The OTP nota- 
tion is slightly more complex, though, in that 
it also shows an underlying form for the sylla- 
ble. The overlap relation of the conventional 
notation's association lines is expressed in the 
OTP notation by the presence of constituent in- 
teriors ("+") in the same vertical slice through 
the diagram. This same-time-slice-membership 
relation is also used to show correspondence. 
Thus we see from this diagram that the sur- 
face "CVCC" syllable corresponds to underly- 
ing "VCC," and that the initial "C" does not 
correspond to any underlying segment. Note 
that tiers with no special marking are used to 
represent the surface level of representation, and 
underlined tiers are used to represent the under- 
lying level of representation. 

3 Handling Reduplication: Overview 
3.1 Overview 

Finite State Machines are useful in phonology 
because it is possible to take any two finite 
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state machines, each of which represents a set of 
strings, and perform an intersection operation 
on them. The resulting machine represents the 
intersection of the two sets of strings. For exam- 
ple, this allows us to use constraints represented 
as FSMs to limit a candidate set. 

Although we would sometimes like to charac- 
terize the candidate sets using CFGs or MCFGs, 
it must be kept in mind that these formalisms 
do not have the property of being intersectable 
with each other. Thus, in OTP terms, it would 
not be possible to represent the constraints as 
CFGs or MCFGs. However, there is a way out: 
it is possible to intersect an FSM with a CFG 
or an MCFG. 

Based on the above, an approach to handing 
reduplication in phonology becomes clear — we 
start with an MCFG that enforces reduplica- 
tive identity, then intersect it with the input 
FSM (produced by Gen), then the constraint 
FSMs, as before. The hard part, then, is to 
come up with an efficient FSM-intersection al- 
gorithm for MCFGs which also deals correctly 
with weighted FSMs. 

3.2 MCFGs 

A grammar formalism that is midway between 
CFGs and CSGs in expressive power, an MCFG 
is like a CFG except that categories may rewrite 
to tuples of strings instead of rewriting to just 
one string as usual. It should be noted that 
MCFGs have been shown ( |van Vugt, 1996| ) to 
be equivalent to string-valued attribute gram- 
mars with only s-attributes, relational gram- 
mars, and top-down tree-to-string transducers, 
so we could use any one of these grammars to 
provide a candidate space. As an example of 
an MCFG, here's a simple MCFG for the lan- 
guage {wwlw G {0, l}"*"} (the language of total 
reduplication) : 

S ^ AoA, 

A - (1,1) 

I (0,0) 

I (OAo,OAi) 

I (1^0,1^1) 

The nonterminals of this grammar are S, which 
has arity 1, and A, which has arity 2. The right- 
hand sides of the productions include notations 
such as ^40, which indicate the placement of each 
part of the tuple-yield of any category. Here, 




Figure 2: Derivation of "010010" 

and Ai are the two parts of the single category 
A, so a rule like A — > (O^OjO^i) indicates that 
A rewrites to A, with the actual strings ar- 
ranged in a tuple with a preceding the first 
part of A in the first half of the pair, and pre- 
ceding the second part of A in the second half 
of the pair. 

This grammar is in the normal form required 
by the algorithms presented here. This normal 
form can be characterized as follows: 

For any category C of arity greater 
than 1, the category may appear in the 
right hand side of a production only if 
the right hand side refers to each ele- 
ment of C exactly once. 

A derivation of the string "010010" in this 
grammar would go as follows: S rewrites as 
^0^1) that is, to the concatenation of the string- 
yield of the two parts of A. From here, and 
Ai must both come from the parts of a sin- 
gle one of the four productions for A. A then 
rewrites to (O^OiO^i), making, for example, 
the value of Aq in the S production be (0 ^o)- 
A then rewrites to (1 1 ^i), so S reduces to 
01 ^0 01 Ai. Finally, A rewrites to (0,0), leav- 
ing the value of S as 010010. This derivation 
is illustrated in Figure |2|, the left side of which 
depicts the derivation tree, while its right side 
shows (from the bottom up) the string-yield of 
each non-terminal (shown just below and to the 
right of it). 
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3.3 Representation of Reduplicative 
Forms in OTP 

OTP constraints are inherently local — they can 
only refer to overlap or non-overlap of interiors 
or edges in an instant of time. Therefore, to en- 
force correspondences between forms, they must 
be juxtaposed so as to occur in the same time- 
slices. In OTP, correspondence between the sur- 
face and underlying forms is established by us- 
ing one set of tiers for the surface form (each tier 
represents either a feature or a type of prosodic 
constituent) and another corresponding set for 
the underlying form. For example, the tier son 
might specify the distribution of the surface fea- 
ture "sonorant" , while the tier son would specify 
its underlying correspondent. Elements of those 
tiers placed in the same time-slice are consid- 
ered to be in correspondence with one another. 
In order to create correspondence between two 
portions of the same surface form, then, we need 
to somehow have them simultaneously juxta- 
posed so as to appear in the same segments of 
time and separated in time as they will be on 
the surface. This is accomplished by a represen- 
tational trick: in the example of reduplication, a 
copy of the reduplicant's surface form is placed 
in a special set of tiers within the base: 



SL 
UL 
RL 



SL 
UL 
RL 



BASE RED2 

URi UR2 

REDi — 

— or — 

RED2 BASE 

UR2 URi 

— REDi 



In these representations SL stands for the sur- 
face level of representation, UL for the under- 
lying level, and RL for the special rcduplicant 
level (the place where a copy of the rcduplicant 
is kept). URi and UR2 are identical in the 
input, and REDi and RED2 need to be kept 
identical by other means. The means chosen 
here is to use an MCFG enforcing the identity. 
BASE- RED correspondence constraints operate 
upon REDi while templatic and general sur- 
face well-formedness constraints operate upon 
RED2. An example of this sort of representa- 
tion might help here. Suppose that there are 
two surface tiers, C and V. Then a form such 



as [CV-I-CVC] (with CV prefixing reduplication, 
assuming that the base is CVC, and with the 
underlying form RED+/VC/) might be repre- 
sented as follows: 

[ + ] [ + ]- [ + 

[ 

+ ] - 
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Note here that the special BASE and RED tiers 
indicate the portions of the surface forms that 
are the base and rcduplicant, and that the rcdu- 
plicant level of representation (that is, the level 
that holds the copy of the rcduplicant used for 
correspondence) is present on the tiers labeled 
with double underlines. The INS tier represents 
a time-discrepancy between the levels of repre- 
sentation where time does not exist on the un- 
derlying level (so the period of time taken up by 
the initial C in the surface rcduplicant and base 
doesn't correspond to anything in the underly- 
ing level) , and the DEL tier represents time that 
does not exist on the surface level, so the time 
taken up by the final C in the underlying form of 
the rcduplicant does not correspond to anything 
on the surface. The RDEL tier is a mirror of the 
contents of the DEL tier in the surface rcdupli- 
cant, and thus represents time that does not 
exist in the special reference copy of the rcdu- 
plicant. This representation allows us to notice 
that the rcduplicant fits a CV template — the 
left edge of it is aligned with a surface C, the 
right edge with a surface V, and there are no 
other segments within it. (The relevant OTP 
constraints to reinforce this would be "RED[ 
C[," "]RED ^ ]V," "]C ±C[± RED," and "]C 
_L V[ _L RED," if highly ranked and in that or- 
der.) 

In terms of translating these representations 

to finite state machines (or to strings), we use 
the alphabet {—,+,[,],!}, so that each FSM 
edge is labeled with a member of this alphabet. 
This representation difi'ers from that of earlier 
accounts of OTP, in that the FSM edges in those 
accounts represented entire time slices, whereas 
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an edge in this representation represents a single 
tier in a time slice. As an example, the repre- 
sentation of: 



C: 
V: 



is as shown in Figure i where the "C" and "V" 
labels are not part of the representation, but 
just there to ease reading. 

3.4 The Grammar Used 

The grammar used here is a bit complicated, 
but the important thing to note about it is that 
it generates exactly the set of possible OTP out- 
put forms in which the special reduplicant ref- 
erence level of representation contains an exact 
copy of the surface reduplicant, placed within 
the time-duration of the base. The grammar 
for a situation in which there are two surface 
tiers appears in Figure ^. Extending this gram- 
mar to other numbers of tiers is straightforward. 
The constituents of this grammar are as follows: 

S The start symbol. 

Non Non-reduplicating material (such as non- 
reduplicating morphemes) before and/or 
after the reduplicating material. 

SSR The surface tiers in a time-slice. 

UR The underlying tiers in a time-slice. 

MRD The reduplicant reference-level tiers in a 
time-slice where the tiers must contain the 
value — (that is, outside of the base, which 
is the only place where the reduplicant level 
is used). 

Rd/Rdl/Rd2 The reduplicating part of an 
utterance. 

BDR A right-facing boundary (allows any- 
thing to be in the surface tiers during its 
time-slice, and copies the right-facing half 
of that material into the reduplicant). 

BDL A left-facing boundary (see BDR). 

B The surface tiers in a time-slice plus identical 
material in the reduplicant tiers. Thus B 
represents an item in the reduplicant plus 
its copy in the special reduplicant reference 
level. 

The remaining non-terminals define different 
values for the INS, DEL, RDEL, RED, and 



BASE tiers, where INS and DEL are as de- 
fined in Albro (1998| ), RDEL represents time 
that does not exist in the reduplicant, RED rep- 
resents the reduplicant (as a morpheme bound- 
ary), and BASE represents the base as a mor- 
pheme boundary: 

NBR represents the state of not being in the 
base or the reduplicant. 

RLE represents the left edge of the reduplicant. 

RRE represents the right edge of the redupli- 
cant. 

BLE represents the left edge of the base. 

BRE represents the right edge of the base. 

RB represents a boundary between a redupli- 
cant and a base, where the reduplicant 
comes first. 

BR represents the reverse of RB. 

RED represents the inside of the reduplicant. 

BASE represents the inside of the base. 

In this grammar any given time-slice will be de- 
fined as SSR or the first component of one of 
the B categories, followed by UR, followed by 
MRD or the second component of one of the 
B categories, followed by one of the NBR, etc., 
categories. 

4 The Earley Algorithm 

The Earley algorithm is an efficient chart pars- 
ing method. Chart parsing can be seen as a 
method for taking the intersection of a string 
or FSM with a CFG (later, an MCFG). Here 
we take a CFG 4-tuple {V,N,P,S) where 
V represents the set of terminals in the gram- 
mar, N represents the set of non-terminals, P 
represents the set of productions, and S £ N 
is the start symbol. In the definitions to fol- 
low, a, (3, and 7 represent arbitrary members 
of (yuN)*, A and C represent arbitrary mem- 
bers of A, a and h represent arbitrary members 
of V , p represents an arbitrary member of P, 
and the indices i, j, and k represent positions 
within the input string to be parsed, numbered 
as in Figure ^. 

In the standard definition, a member of the 
chart is a 3-tuple (i, C —f a»l3,j), where i repre- 
sents the position at the beginning of the input 
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Figure 3: FSM Representation Used Here 



string covered by a and j represents the posi- 
tion at the end of the covered portion of the 
string. The parsing operation in the standard 
definition, which parses a single input string, 
is defined closure via the following three 

inference rules of a chart initially consisting of 
(0,5^«a,0): 



predict: 



if ^ ^ 7 G P (if 7 begins 



scan: 



with a terminal, that terminal must be the 
symbol at position j in the input string) 

(?,C^a»a/3j) 



complete: 



(i,C— »aai 



The input string is recognized if the chart con- 
tains an element (0, S —>■ n), where n is the 
final position of the input string. 

5 Extending Earley 

The algorithm presented so far just checks to see 
whether a particular string exists in a grammar. 
In order for it to be useful for our purposes, the 
following extensions must be made: 

1. Intersection with an FSM, not just a string 

2. Recovery of intersection grammar 

3. Weights (intersection should allow lowest- 
weight derivations only) 

4. MCFGs 

5.1 Intersection with an FSM 

To modify the algorithm to intersect a grammar 
with an FSM, we replace the input string with 
an FSM, and change our definition of a chart 
entry. Now, a chart entry is a 3-tuple (i, C 
a • where i represents the first FSM state 



covered by a and j represents the last FSM state 
covered. We define an FSM here as a 5-tuple 
{Q,T,,s,F,M), where Q is the set of states in 
the FSM, S is the label alphabet for the FSM 
(for our purposes S is always the same as V 
for all grammars in use), s G Q is the start 
state, F C Q is the set of final states of the 
FSM, and M is a set of 3-tuplcs {i,a,j), which 
represent transitions from state i to state j with 
label a. Given these redefinitions we can then 
just modify the scan rule: 



',3 j+i) " symbol after j scan- 



i!!£Z^+i) (j,a,fc)eM, where M 



is the input FSM. 



and the predict rule in the obvious way: 

7 G P (if 7 is of 



predict: (f^?^ if ^ 



the form a 7', {j,a,k) 
well) 



G M must hold as 



Note that the initial entry in the chart is now 

(s, S s). 

5.2 Grammar Recovery 

It is possible to recover the output of intersec- 
tion by increasing slightly what is in the chart. 
In particular, for every item on the chart, we 
note how it got there (just the last step). Each 
item on the chart may be referred to by its col- 
umn number C and its position N within that 
column. We annotate only items produced by 
scan and complete steps, as follows: 

• sC/N 

• cCi/Ni;C2/N2 

where Ci/Ni refers to the (j, A item 
from the complete step, and C2/N2 refers to the 
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In cases where the reduphcant precedes the base, the 
redupUcation rules will appear as follows: 



Rdl 
Rd2 



BDRo UR MRD RLE Rd2o 

BDLo UR BDRi RB Rd2i 

SSR UR BDLi BRE 

Bo UR MRD RED 

SSR UR Bi 

Rd2o 
Rd2i 



BAS 

Bo UR MRD RED, 
SSR UR Bi BAS 



Otherwise, where the base precedes the reduplicant, the 
rules will appear as follows: 



Rdl 

Rd2 



SSR UR BDRi BLE Rd2o, 
BDRo UR BDLi BR Rd2i 
BDLo UR MRD RRE 

SSR UR Bi BAS, 

Bo UR MRD RED 

Rd2o SSR UR Bi BAS., 

Rd2i Bo UR MRD RED 



Figure 4: Reduplication Grammar 
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Figure 5: Numbering of string positions in the 
string "abc" 

{i,C —> a • Af3,j) item. A chart item is thus 
now a 4-tuple {i,C —>■ a* P, j, H), where if is a 
set of history items of the type described here, 
one for each scan or complete step that put the 
item there. 

Recovery of a grammar then starts from the 
"success items," that is items in the chart that 
begin in state 1 and end with a final state and 
represent a production from the start symbol 
of the grammar, with the Ear ley position dot 
at the end of the production. We then move 
from right to left within those productions, fill- 
ing in the state pairs for each constituent we 
pass, and tracing through their productions as 
well. Whenever we get to the left side of a pro- 
duction, we output it. The exact algorithm is 
as follows: 

Grammar Recovery (c/iart) 

queue ^ [] 

for all success items {s,S — > 7», / G 
F, Ho) at (C, N) do 

queue up (C, N) onto queue 
while queue not empty do 
(C, N) <— dequeue from queue 
item <— item at {C,N): {i,A 
a* J, Hi) 

pos <— pos. of • in item 

RHSs ^ GetRHSs([[]], item, pos, 

queue) 

for all RHS G RHSs do 

output ^ RHS'' 

end for 
end while 
end for 

GetRHSs(r/iss, item, pos, queue) 

if pos = then 

return rhss 
end if 

new_rhss <— [] 

for all history path components hitem 
of item do 

rhss' <— copy rhss 

ext end (r/iss', hitem, pos, queue) 
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add rhss' to new^rhss 
end for 

return new_rhss 

extend(r/iss, hitem, pos, queue) 

if hitem = s(C, A'') then 

prepend scanned symbol to each rhs 
G rhss 

prev ^ item at (C, N) 
else if hitem = c(Ci/A^i; (72/A^2) then 

(i, A 7», j, H) ^ item at (Ci, A^i) 

prepend A{i,j) to each rhs G r/iss 

enter (Ci,A''i) into queue 

prev <— item at (C2, A^2) 
end if 

return GetRHSs(r/tss, item, pos—1, 
queue) 

5.3 Weights 

The basic idea for handhng weights is an adap- 
tion from the Viterbi algorithm, as used for 
chart parsing of probabilistic grammars. Basi- 
cally, we reduce the grammar to allow only the 
lowest-weight derivations from each new cate- 
gory. 

Implementation: Each chart item has an as- 
sociated weight, computed as follows: 

predict: weight of the predicted rule A ^ j 

scan: sum of the weight of the item scanned 
from and the weight of the FSM edge 

scanned across. 

complete: sum of the weights of the two items 
involved 

We build new chart items whenever permitted 
by the rules given in previous sections, assigning 
weights to them by the above considerations. If 
no equivalent item (equivalence ignores weight 
and path to the item) is in the chart, we add 
the item. If an equivalent item is in the chart, 
there are three possible actions, according to the 
weight of the new item: 

1. Higher than the old item: do nothing 
(don't add the new path). 

2. Lower than the old item: remove all other 

paths to the item, add this path to the 
item. Adjust weights of all items built from 
this one downward. 



3. Same as the old item: add the new path to 
the item. 

A chart item is thus now a 5-tuple {w,i,C 
a • P,j,H), where w represents a weight, and 
all the other items are as before. 

5.4 MCFGs 

To extend the Earlcy algorithm to MCFGs, we 
first reduce the chart-building part of the Earley 
algorithm for MCFGs to the already-worked out 
algorithm for CFGs by converting the MCFG 
into a (not-equivalent) CFG. We then mod- 
ify the grammar-recovery step to convert the 
CFG produced into an MCFG, verifying that 
the MCFG produced is a proper one. 

5.4.1 Adjustments to the 
Chart-Building Algorithm: 

First, we treat each part of the rule as a sepa- 
rate rule, and use the regular algorithm. Thus, 

B^^ ^ ^ becomes Bq^ and Bi^ 1. Hav- 
ing separated a single rule such as C ^ (a, f3) 
into two parts Cq ^ a and Ci (3, we need to 
keep track, when building the chart and after, of 
which rule in the associated MCFG each chart 
item refers to. These annotations will be useful 
in Grammar Recovery (something like Co ^ a 
can only be combined with Ci — >■ /3 if they both 
come from the same MCFG rule). Thus, a chart 
item is a 6-tuple (r, w,i,C — > a»P,j, H), where 
r is the rule number from the original MCFG to 
which the production C ^ a • P corresponds, 
and all the others are as before. 

5.4.2 Adjustments to Grammar 
Recovery 

As before, followed by a final combinatory and 
checking step: 

for all non-terminals A with arity n do 
for all possible combinations AQ{i,j) — >• 
jo,Ai{k,l) ^ 71, . . .,Anim,n) 7„ do 
if the MCFG condition applies to the 
combination then 

output A{i, j){k,l) . . . {m,n) 

(70,7l,---,7n) 

end if 
end for 
end for 

where the MCFG condition is as follows: 

All 7j on the right hand side of the 
combination must be derived from the 
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same rule in the original set of rules 
and their yields must not overlap each 
other in the FSM. 

Given the way the chart-parsing and recovery 
algorithms work, the MCFG condition will be 
satisfied if we simply check that all the elements 
of the combination come from the same rule in 
the original MCFG. This will result in some 
invalid rules in the output grammar, but this 
simple check guarantees that these rules will be 
such that they will be unable to participate in 
derivations, since their right-hand sides will re- 
fer to categories that do not head any produc- 
tions. 

5.5 Example 

As an example, let's take the simple reduplica- 
tion grammar from before: 



(1) 

(2) 
(3) 
(4) 
(5) 



S 
A 



Aq Ai 

(1,1) 
(0,0) 

(0^0, OAi) 

(1^0,1^1) 



and intersect it with the machine 




This machine generates the set of strings 
{0|1}+, but weights all strings ending with 0. 

The corresponding CFG-grammar used for 
the chart-building step is as follows: 



(1) 


S 


AqAi 


(2) 


Ao 


1 




Ai 


^ 1 


(3) 


Ao 


^ 




Ai 


^ 


(4) 


Ao 


^ Oylo 




Ai 


^ OAi 


(5) 


Ao 


^ 1^0 




Ai 


lA, 







lirilnTvin 1 i i 




J., t - 




# 


r w 


€ P 






H 





1 


S tAo Ai 









1 


2 


Ao •! 









2 


3 


Ao -0 









3 


4 


Ao .0 Ao 









4 


5 


Ao^»l Ao 









5 


5 


Ao^l* Ao 






{sl/4} 


6 


4 


^0 ^ • Ao 






{sl/3} 


7 


3 


Ao 0. 






{sl/2} 


8 


1 


S^Ao»Ai 










{cl/7;l/0, cl/10;l/0,c 


1/9; 1/0, 


cl/22; 1/0} 


9 


5 


Ao 1 Ao» 










{cl/7; l/5,cl/10; 1/5, f 


1/9; 1/5, 


cl/22; 1/5} 


10 


4 


Ao -^OAo* 










{cl/7; l/6,cl/10; 1/6, c 


1/9; 1/6, 


cl/22; 1/6} 


11 


2 


Ai ^ .1 









12 


3 


Ai -0 









13 


4 


Ai .0 Ai 









14 


5 


Ai •! Ai 









15 


5 


Ai ^l»Ai 






{sl/14} 


16 


4 


^1 ^ 0«yli 






{sl/13} 


17 


3 


Ai 0« 






{sl/12} 


18 


1 


S ^Ao Ai» 










{cl/17; 1/8, cl/20; 1/8, cl/19; 1/8, cl/21; 1/8} 


19 


5 


Ai lAi» 










{cl/17; 1/14, cl/20; 1/14, 


cl/19; 1/14, cl/21; 1/14} 


20 


4 


Ai ^OAi* 










{cl/17; 1/13, cl/20; 1/13, 


cl/19; 1/13, cl/21; 1/13} 


21 


2 


Ai U 






{sl/11} 


22 


2 


Ao ^ 1» 






{sl/1} 



Column 2 (j = 2, i = 1) 



# 




1 
2 
3 

4 

5 

6 
7 
8 
9 

10 

11 

12 
13 



e P 



H 



5 Ao^l»Ao {sl/4} 

4 1 Ao^OtAo {sl/3} 

3 1 Ao^O* {sl/2} 
1 S ^ ^0 • ^1 

{c2/13; 1/0, c2/5; 1/0, c2/4; 1/0} 

5 Ao 1 Ao» 

{c2/13; 1/5, c2/5; 1/5, c2/4; 1/5} 

4 Ao^O Ao» 

{c2/13; 1/6, c2/5; 1/6, c2/4; 1/6} 

5 Ai^l»Ai {sl/14} 

4 1 Ai^QmAi {sl/13} 

3 1 Ai^O» {sl/12} 

1 S^AoAim 

{c2/12; 1/8, c2/ll; 1/8, c2/10; 1/8} 

5 Ai^lAi* 

{c2/12; 1/15, c2/ll; 1/15, c2/10; 1/15} 

4 Ai^OAi* 

{c2/12; 1/16, c2/ll; 1/16, c2/10; 1/16} 

2 Ai^U {sl/11} 
2 Ao^U {sl/1} 



The chart produced by the chart-building 
part of the algorithm is as follows: 



In this chart the items with an empty history 
list were entered by prediction steps. The "suc- 
cess item" for this grammar is then item (2,9): 
{r = l,w = 0,1 = l,p = S ^ Aq Ai»,j = 
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2,H = {c2/12;l/8,c2/ll;l/8,c2/10;l/8}), so 
begin there: 

S{l,2)^Ao Ai 

We then queue up (2,12), (2,11), and (2,10), 
noting that for all of these the states for Ai 
arc (1,2), and we move to item (1,8): (r = 

1, w = 0,i = l,p = S Aq • Ai,j = 1, H = 
{cl/7; l/0,cl/10; 1/0, cl/9; 1/0, cl/22; 1/0}). 
Here we queue up (1,7), (1,10), (1,9), and 
(1,22), noting that for all of these the states for 
Aq are (1,1). Moving to (1,0), we note that we 
are done, and we thus output a complete rule: 

(rl)5(l,2)^Ao(l,l) ^i(l,2). 

We then encounter (2,12) on the queue: (r = 

2, w = 0,i = l,p = Ai ^ U,j = 2,H = 
{sl/11}), which can be output with no further 
ado: 

(r2) ^i(l,2) ^1 

Moving to item (2,11) (r = A,w = 

0. 1 = l^p = Ai ^ Ai»,j = 
2,H = {c2/12; 1/16, c2/ll; 1/16, c2/10; 1/16}) 
we don't need to queue anything, and we can 
see that the output will be: 

(r4) Ai(l,2) ^0Ai(l,2) 

Item (2,10) is (r = 5,w = 0,i = 

1, p = Ai ^ 1 Ai;j = 2,H = 
{c2/12; 1/15, c2/ll; 1/15, c2/10; 1/15}), so we 
output 

(r5) Ai(l, 2)^1^1(1,2) 

We now move on to item (1,7): (r = 3,w = 

0. 1 = l,p = Ao ^ 0.,j = 1,H = {sl/2}), 
which we output as 

(r3) ^o(l, 1) ^ 0. 

Item (1,10) is (r = A,w = 0, i = 

1, p = Ao ^ Ao;j = 1,H = 
{cl/7; 1/6, cl/10; 1/6, cl/9; 1/6, cl/22; 1/6}). 
In dealing with this we need to queue nothing, 
and we output: 

(r4) ^o(l,l) ^OAo(l,l) 

Moving to (1,9), which is (r = 5,w = 
0,1 = l,p = Ao 1 Ao;j = 1,H = 



{cl/7; 1/5, cl/10; 1/5, cl/9; 1/5, cl/22; 1/5}), 
we queue nothing and output 

(r5) ^o(l,l) ^l^o(l,l) 

Finally we get to (1,22): (r = 2,w = 0,i = 
l,p = ^0 — !•) j = 1,H = {sl/1}), which gets 
output as 

(r2) Ao(l,l)^l 

Collecting these together (for category A), we 
get the following pairings: 




^o(l,l) 
^o(l,l) 
A) (1,1) 
^o(l,l) 



1 



OAo(l, 1) 
lAo(l,l) 



^1(1,2) 

Ai(l,2) 
Ai(l,2) 



Ai(l,2) 
l^l(l,2) 



Note that the "pair" for (r3) has no second 
member, so nothing will be output for it. Com- 
bining the compatible rules, we get the following 
grammar: 

5(1,2) ^ A(l,l)(l,2)o A(l,l)(l,2)i 
A(l,l)(l,2) ^ (1,1) 

I (0^o(l,l)(l,2),OAi(l,l)(l,2)) 
I (lAo(l,l)(l,2),l Ai(l,l)(l,2)) 

which is equivalent to the grammar: 



S 
A 



Ao Ai 

(1,1) 

(0 Ao,OA,] 

(1^0,1^1) 



This grammar indeed represents the best out- 
puts from the intersection — all reduplicating 
forms which end in a 1. 
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